Opportunistic Scheduling as Restless Bandits

نویسندگان

Vivek S. Borkar

Gaurav S. Kasbekar

Sarath Pattathil

Priyesh Y. Shetty

چکیده

In this paper we consider energy efficient scheduling in a multiuser setting where each user has a finite sized queue and there is a cost associated with holding packets (jobs) in each queue (modeling the delay constraints). The packets of each user need to be sent over a common channel. The channel qualities seen by the users are time-varying and differ across users; also, the cost incurred, i.e., energy consumed, in packet transmission is a function of the channel quality. We pose the problem as an average cost Markov Decision Problem, and prove that this problem is Whittle Indexable. Based on this result, we propose an algorithm in which the Whittle index of each user is computed and the user who has the lowest value is selected for transmission. We evaluate the performance of this algorithm via simulations and show that it achieves a lower average cost than the Maximum Weight Scheduling and Weighted Fair Scheduling strategies.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lazy Restless Bandits for Decision Making with Limited Observation Capability: Applications in Wireless Networks

In this work we formulate the problem of restless multi-armed bandits with cumulative feedback and partially observable states. We call these bandits as lazy restless bandits (LRB) as they are slow in action and allow multiple system state transitions during every decision interval. Rewards for each action are state dependent. The states of arms are hidden from the decision maker. The goal of t...

متن کامل

Marginal productivity index policies for scheduling restless bandits with switching penalties

We address the dynamic scheduling problem for discrete-state restless bandits, where sequence-independent setup penalties (costs or delays) are incurred when starting work on a project. We reformulate such problems as restless bandit problems without setup penalties, and then deploy the theory of marginal productivity indices (MPIs) and partial conservation laws (PCLs) we have introduced and de...

متن کامل

Restless Bandits, Partial Conservation Laws and Indexability

We show that if performance measures in a general stochastic scheduling problem satisfy partial conservation laws (PCL), which extend the generalized conservation laws (GCL) introduced by Bertsimas and Niño-Mora (1996), then the problem is solved optimally by a priority-index policy under a range of admissible linear performance objectives, with both this range and the optimal indices being det...

متن کامل

Behaviors Coordination Using Restless Bandits Allocation Indexes

In order to remain viable and to reproduce an animal has to continuously deal with the problem of choosing the right behavior among several others (e.g. obtaining food, obtaining water, avoiding predators,. . .) at the right time. In robotics this problem arises when we want to synthesize a complex behavior from elementary behaviors. Within the reinforcement learning framework we review the beh...

متن کامل

Optimal Policies for a Class of Restless Multiarmed Bandit Scheduling Problems with Applications to Sensor Management

Consider the Markov decision problems (MDPs) arising in the areas of intelligence, surveillance, and reconnaissance in which one selects among different targets for observation so as to track their position and classify them from noisy data [9], [10]; medicine in which one selects among different regimens to treat a patient [1]; and computer network security in which one selects different compu...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1706.09778 شماره

صفحات -

تاریخ انتشار 2017

Opportunistic Scheduling as Restless Bandits

نویسندگان

چکیده

منابع مشابه

Lazy Restless Bandits for Decision Making with Limited Observation Capability: Applications in Wireless Networks

Marginal productivity index policies for scheduling restless bandits with switching penalties

Restless Bandits, Partial Conservation Laws and Indexability

Behaviors Coordination Using Restless Bandits Allocation Indexes

Optimal Policies for a Class of Restless Multiarmed Bandit Scheduling Problems with Applications to Sensor Management

عنوان ژورنال:

اشتراک گذاری